Multi-objective Bandits: Optimizing the Generalized Gini Index
نویسندگان
چکیده
We study the multi-armed bandit (MAB) problem where the agent receives a vectorial feedback that encodes many possibly competing objectives to be optimized. The goal of the agent is to find a policy, which can optimize these objectives simultaneously in a fair way. This multi-objective online optimization problem is formalized by using the Generalized Gini Index (GGI) aggregation function. We propose an online gradient descent algorithm which exploits the convexity of the GGI aggregation function, and controls the exploration in a careful way achieving a distribution-free regret Õ(T 1/2) with high probability. We test our algorithm on synthetic data as well as on an electric battery control problem where the goal is to trade off the use of the different cells of a battery in order to balance their respective degradation rates.
منابع مشابه
A Generalized Gittins Index for a Class of Multiarmed Bandits with General Resource Requirements
We generalise classical multi-armed and restless bandits to allow for the distribution of a (fixed amount of a) divisible resource among the constituent bandits at each decision point. Bandit activation consumes amounts of the available resource which may vary by bandit and state. Any collection of bandits may be activated at any decision epoch provided they do not consume more resource than is...
متن کاملA Note on Bandits with a Twist
A variant of the multi-armed bandit problem was recently introduced by Dimitriu, Tetali and Winkler. For this model (and a mild generalization) we propose faster algorithms to compute the Gittins index. The indexability of such models follows from earlier work of Nash on generalized bandits.
متن کاملEstimating Quality in User-Guided Multi-Objective Bandits Optimization
Many real-world applications are characterized by a number of conflicting performance measures. As optimizing in a multi-objective setting leads to a set of non-dominated solutions, a preference function is required for selecting the solution with the appropriate trade-off between the objectives. This preference function is often unknown, especially when it comes from an expert human user. Howe...
متن کاملThe Generalized Gini index and the measurement of income mobility
Two new normative indices of mobility are proposed. The first one is a population weighted generalized Gini mobility index and will be higher, the higher the size of the transfer between two individuals and, for a given transfer, the higher the rank difference between the individuals between whom the transfer takes place. This index is also higher, the greater the rank gap between the individua...
متن کاملGeneralized Gin1 Inequality Indices
When incomes are ranked in descending order the social-evaluation function corresponding to the Gini relative inequality index can be written as a linear function with the weights being the odd numbers in increasing order. We generalize this function by allowing the weights to be an arbitrary non-decreasing sequence of numbers. This results in a class of generalized Gini relative inequality ind...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017